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In  a  decision  diagram,  the  average  path  length  (APL)  is  the  average  number  of  nodes  on  a  path 
from  the  root  node  to  a  terminal  node  over  all  assignments  of  values  to  variables.  Smaller  APL 
values  result  in  faster  evaluation  of  the  function  represented  by  a  decision  diagram.  For  some 
functions,  the  APL  depends  strongly  on  the  variable  order.  In  this  paper,  we  propose  an  exact 
and  a  heuristic  algorithm  to  determine  the  variable  order  that  minimizes  the  APL.  Our  exact 
algorithm  uses  branch-and-bound.  Our  heuristic  algorithm  uses  dynamic  reordering,  where 
selected  pairs  of  variables  are  swapped.  This  paper  also  proposes  an  exact  and  a  heuristic 
algorithm  to  determine  the  pairs  of  binary  variables  that  reduce  the  APL  of  multi-valued  de¬ 
cision  diagrams  (MDDs)  for  a  4- valued  input  2-valued  output  function.  Experimental  results 
show  that  the  heuristic  algorithm  is  much  faster  than  the  exact  one  but  produces  comparable 
APLs.  Both  algorithms  yield  an  improvement  over  an  existing  algorithm  in  both  APL  and 
runtime.  Experimental  results  for  2-valued  cases  and  4-valued  cases  are  shown. 

Keywords:  BDD,  MDD,  average  path  length  (APL),  node  traversing  probability,  edge  traversing  probability,  branch- 
and-bound,  sifting  algorithm. 


1  INTRODUCTION 

Binary  decision  diagrams  (BDDs)  [5]  and  multi-valued  decision  diagrams  (MDDs)  [  15]  are  extensively  used  in  logic  syn¬ 
thesis  [10],  logic  simulation  [1,  13,  17],  software  synthesis  [2,  14],  and  pass  transistor  logic  (PTL)  [3,  29,  30].  These 
applications  use  decision  diagrams  to  evaluate  logic  functions,  and  the  evaluation  time  is  proportional  to  the  average  path 
length  (APL)  in  the  decision  diagram.  Therefore,  minimization  of  the  APL  leads  to  faster  evaluation  of  the  logic  function. 
Particularly,  in  logic  simulation  using  decision  diagrams  [1,  13,  17],  minimization  of  the  APL  reduces  the  simulation  time 
substantially  because  logic  functions  are  evaluated  many  times  with  different  test  vectors. 

Minimization  of  the  APL  can  also  be  applied  to  logic  synthesis.  A  method  for  functional  decomposition  [32]  uses 
BDDs  to  detect  Boolean  divisors.  The  quality  of  a  divisor  is  measured  by  the  number  of  don’t-cares  it  provides  for  the 

1  Since  this  paper  is  reformatted  to  A4  size,  page  numbers  differ  from  original  one. 


437 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

2QQ5  2.  REPORT  TYPE 

3.  DATES  COVERED 

4.  TITLE  AND  SUBTITLE 

Exact  and  Heuristic  Minimization  of  the  Average  Path  Length  in  Decision 
Diagrams 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Naval  Postgraduate  School, Department  of  Electrical  and  Computer 
Engineering, Monterey, CA, 93943 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited. 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF:  17.  LIMITATION  OF 

18.  NUMBER  19a.  NAME  OF 

a.  REPORT  b.  ABSTRACT  c.  THIS  PAGE 

unclassified  unclassified  unclassified 

21 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


438 


Nagayama,  et  at. 


minimization  of  the  quotient.  The  don’t-cares  are  generated  by  the  paths  in  the  BDD  that  lead  to  the  terminal  nodes.  The 
shorter  the  paths,  the  more  don’t-care  minterms  they  contain.  Therefore,  minimizing  the  APL  in  BDDs  can  improve  the 
quality  of  decomposition. 

In  pass  transistor  logic  (PTL)  synthesis,  the  circuits  are  derived  directly  from  BDDs  representing  logic  functions.  In  this 
case,  the  longer  paths  in  BDDs  cause  larger  voltage  drop  and  larger  delay.  This  problem  can  be  solved  by  inserting  buffers 
in  long  paths  [3],  Minimizing  the  APL  in  the  BDD  can  reduce  the  number  of  buffers  that  must  be  inserted. 

In  this  paper,  we  propose  an  exact  APL  minimization  algorithm  based  on  the  branch-and-bound  algorithm.  This  algo¬ 
rithm  finds  an  optimum  variable  order  much  faster  than  exhaustive  search,  which  enumerates  all  possible  variable  orders. 
However,  the  exact  method  is  time-consuming  for  functions  with  many  inputs.  To  minimize  the  APL  of  such  functions  in  a 
reasonable  time,  we  propose  a  heuristic  algorithm  based  on  dynamic  variable  reordering. 

This  paper  is  organized  as  follow.  Section  2  contains  the  necessary  terminology  and  definitions.  Section  3  introduces 
lower  bounds  on  the  APL.  Section  4  proposes  an  exact  and  a  heuristic  minimization  algorithm  for  the  APL.  Section  5  con¬ 
siders  the  paired  ordering  of  binary  variables.  Section  6  shows  the  efficiency  of  the  algorithms  using  benchmark  functions. 
Experimental  results  for  2-valued  cases  and  4- valued  cases  are  shown.  The  Appendix  includes  the  proofs  of  theorems. 


2  PRELIMINARIES 

We  assume  that  the  reader  is  familiar  with  the  basic  terminology  of  reduced  ordered  binary  decision  diagrams  (ROBDDs)  [5] 
and  reduced  ordered  multi-valued  decision  diagrams  (ROMDDs)  [15],  In  the  following,  a  BDD  and  an  MDD  mean  an 
ROBDD  and  an  ROMDD.  DD  means  either  BDD  or  MDD. 

Definition  2.1  Let  x  be  an  r-valued  variable,  and  let  c  £  {0,1 _ _  r  —  1}.  Then,  P(x  =  c)  denotes  the  probability  that  x  has 

value  c. 

Definition  2.2  In  a  DD,  a  sequence  of  edges  and  non-terminal  nodes  leading  from  the  root  node  to  a  terminal  node  is  a 
path.  The  number  of  edges  in  the  path  is  the  path  length. 

Note  that  the  sequence  of  edges  in  a  path  /?,  of  a  DD  corresponds  to  an  assignment  of  values  a,  to  the  specific  variables 
associated  with  those  edges  in  the  DD.  We  say  that  such  an  assignment  a,-  selects  path  Similarly,  if  an  assignment  of 
values  Ci  to  all  variables  agrees  with  a ,•  for  all  variables  assigned  in  a,-,  we  also  say  Cj  selects  path 

Definition  2.3  In  a  DD  for  an  n-variable  function,  the  path  probability  of  a  path  pi,  denoted  by  PP(pi),  is  the  probability 
that  the  path  pt  is  selected  in  all  assignments  of  values  to  the  r-valued  variables.  PP(pi)  is  given  by 

PP(Pi)  =  X  P(X1  =  c0  X  P(x 2  =  C2)  x  . . .  X  p(xn  =  Cn), 
ceQ 

where  Cj  denotes  a  set  of  assignments  of  values  to  the  variables  selecting  the  path  pj,  c  =  (ci,C2,...,c„),  each  cj  £ 
{0, 1, . . .  ,r—  1},  and  P{xj  =  cf)  is  the  probability  xj  has  value  Cj. 

Definition  2.4  The  average  path  length,  or  APL,  in  a  DD  is  given  by: 

N 

APL=JjPP(pl)xll, 
i= 1 

where  i  indexes  the  paths,  N  denotes  the  number  of  paths,  and  /,-  denotes  the  path  length  of  path  p\. 

Definition  2.5  The  node  traversing  probability  of  a  node  v,  denoted  by  NTP(y),  is  the  probability  that  an  assignment  of 
values  to  the  variables  selects  a  path  that  includes  the  node  v. 

Definition  2.6  The  edge  traversing  probability  of  an  edge  e,  denoted  by  ETP{e),  is  the  probability  that  an  assignment  of 
values  to  the  variables  selects  a  path  that  includes  the  edge  e. 

Note  that  the  node  traversing  probability  of  the  root  node  in  a  decision  diagram  is  1.0,  since  all  paths  start  from  the  root 
node. 
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Figure  1 :  Example  of  node  traversing  probability  in  a  BDD. 


Lemma  2.1  [27]  The  node  traversing  probability  of  node  v  is  the  sum  of  the  edge  traversing  probabilities  of  all  incoming 
edges  to  v.  Also,  the  node  traversing  probability  of  node  v  is  the  sum  of  the  edge  traversing  probabilities  of  all  outgoing 
edges  from  v. 

Proof.  See  Appendix. 

From  Lemma  2. 1 ,  the  following  relation  holds: 

ETP(e)  =  P{x  =  c)x  NTP(v), 

where  P(x  =  c)  is  the  probability  x  has  a  value  c,  v  is  a  node  representing  a  variable  x,  and  e  is  an  outgoing  edge  correspond¬ 
ing  to  the  value  c  of  v. 

Theorem  2.1  [27]  The  APL  is  equal  to  the  sum  of  the  edge  traversing  probabilities  of  all  edges.  Also,  the  APL  is  equal  to 
the  sum  of  the  node  traversing  probabilities  of  all  the  non-terminal  nodes. 

Proof.  See  Appendix. 

From  Theorem  2.1,  we  have  the  following: 

Ne  Nv 

APL  =  j,ETP{ei)  =  X  NTP(vj), 

i=  1  j=  1 

where  Ne  and  Nv  denote  the  number  of  edges  and  non-terminal  nodes,  respectively. 

Example  2.1  Consider  the  BDD  in  Fig.  1(a),  where  solid  lines  and  dotted  lines  denote  1 -edges  and  O-edges,  respectively. 
For  simplicity,  assume  that  P(xi  =  0)  =  P(x,-  =  1)  =  0.50  (i  =  1,2, 3, 4).  This  BDD  has  10  different  paths:  path  pi  is 
(vi,ei,v2,e3),  path  p2  is  (vi  ,e\ ,  v2le^,  V4,e7),  . . .,  and  path  pw,  is  (vi,e2,  ^3,^5,  V5,^io)-  The  PP(Pi)  and  path  length  of  each 
path  pi  are  listed  in  Fig.  1(b).  Therefore,  by  Definition  2.4, 

to 

APL=^PP(Pi)xli=  3.125. 

;=i 

By  using  node  traversing  probabilities,  we  can  compute  this  APL  as  follows:  First,  we  have  NTP(yi)  =  1.00  for  root  node 
vi.  Then,  NTP(v2)  =  ETP(ei)  =  P(x\  =  0)  x  NTP(v i)  =  0.50  andNTP(v3)  =  ETP(e2)  =  P(x i  =  1)  x  NTP(vi)  =  0.50. 
Similarly, 


NTP(yf)  =  P(x 2  =  1)  x  NTP{v2 )  +  P(x2  =  0)  x  NTP(v3 )  =  0.50, 

NTP(v$)  =  P(x2  =  1)  x  NTP(v3)  =  0.25,  and 

NTP(v6 )  =  P(x3  =  1)  x  NTP(vf)  +  P(x3  =  0)  x  NTP(v5 )  =  0.375. 
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Cut(i) 


Figure  2:  Partition  of  MDD. 


Thus,  we  obtain 

6 

APL=  "YjNT  P(vj)  =  3.125. 

i=  1 

Similarly,  we  can  compute  the  APL  using  the  edge  traversing  probabilities.  ( End  of  Example) 

Consider  a  multiple-output  function  F  =  (/o,/i ,  •  ••  ,/m-i):  Rn  — ►  Rm,  where  =  {0, 1, . . .  ,r  —  1},  and  n  and  m  denote 
the  number  of  input  and  output  variables,  respectively.  In  this  paper,  we  use  shared  MDDs  (SMDDs)  to  represent  multiple- 
output  function  F.  For  reasons  that  will  be  clear  later,  we  view  the  APL  of  an  SMDD  as  the  sum  of  the  APLs  of  the 
individual  MDDs  for  each  component  function  f. 

3  LOWER  BOUNDS  ON  APL 

In  this  section,  we  derive  lower  bounds  on  the  APL.  Such  bounds  are  used  to  reduce  the  computation  time  in  the  algorithm, 
as  discussed  later. 

Definition  3.1  Suppose  an  MDD  is  partitioned  into  two  parts  as  shown  in  Fig.  2.  Here,  Xupper  denotes  the  variables  above 
or  in  level  i,  Xiower  denotes  the  variables  below  or  in  level  i  +  1,  and  Cut(i)  denotes  a  set  of  edges  connecting  the  nodes 
above  or  in  level  i  with  the  nodes  below  or  in  level  i+  L 

Note  that  the  nodes  are  indexed  by  i  starting  with  the  root  node  at  level  1.  The  nodes  just  below  have  i  =  2,  etc.. 

Definition  3.2  ETP(Cut(i))  denotes  the  sum  of  edge  traversing  probabilities  of  edges  in  Cut{i),  and  is  given  by 

ETP{Cut(i))  =  £  ETP(e). 

e£Cut(i ) 

Lemma  3.1  Suppose  an  SMDD  represents  a  multiple-output  function  F  =  (/o,/i , . . .  ,fm- 1).  Then, 

ETP(Cut(i))  =  mu, 

where  mu  is  the  number  of  the  root  nodes  of  the  multiple-output  function  F  above  or  in  level  i. 

Proof.  See  Appendix. 

Corollary  3.1  Suppose  an  MDD  represents  a  single-output  function  f.  Then, 

ETP(Cut(i))  =  1.0. 

Lemma  3.2  Let 

Cut' (t)  =  {e  |  e  £  Cut(i),  such  that  e  is  incident  to  only  non-terminal  nodes}. 

Then,  for  every  permutation  ofXupper, 


where  Ci  <  mu- 


ETP(Cut'(i))  =  a , 
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Proof.  See  Appendix. 

Theorem  3.1  Consider  an  SMDD  for  multiple-output  function  F.  Let  L  be  the  sum  of  the  node  traversing  probabilities  of 
the  non-terminal  nodes  below  or  in  level  i+  1.  Let  m l  be  the  number  of  root  nodes  for  F  below  or  in  level  i  +  1.  Then,  for 
any  permutation  ofXiower  and  any  permutation  ofXupper, 

ET P(Cut'(i ))  +  mi  <  L. 

Proof.  See  Appendix. 

Theorem  3.2  Consider  an  SMDD  for  multiple-output  function  F.  Let  U  be  the  sum  of  the  node  traversing  probabilities  of 
the  non-terminal  nodes  above  or  in  level  i.  When  the  order  ofXupper  is  fixed, 

U  +  ETP(Cut'(i))  +  mL  <  APL. 

Proof.  See  Appendix. 

Corollary  3.2  Consider  an  SMDD  of  multiple-output  function  F.  Let  U  and  L  be  the  sums  of  the  node  traversing  probabil¬ 
ities  of  the  non-terminal  nodes  above  and  below  or  in  level  i,  respectively.  Then 

max{L,  U }  <  APL. 


4  MINIMIZATION  OF  APL 

Since  the  APL  in  a  DD  (BDD  or  MDD)  depends  on  the  variable  order,  the  APL  minimization  problem  can  be  formulated 
as  follows: 

Problem  4.1  Given  a  DD  for  a  logic  function  f,  find  a  variable  order  that  produces  the  minimum  APL. 

4.1  Change  of  the  APL  during  Swapping  Two  Adjacent  Variables 

Our  APL  minimization  algorithms  go  from  one  variable  order  to  another  variable  order  by  a  sequence  of  steps  that  swap 
pairs  of  adjacent  variables.  A  part  of  the  algorithms  that  has  a  significant  effect  on  computation  time  is  updating  the  APL 
after  swapping  each  pair  of  adjacent  variables.  This  section  describes  a  fast  method  to  update  the  APL  after  the  swap  of  two 
adjacent  variables. 

Theorem  4.1  Let  U  be  the  sum  of  the  node  traversing  probabilities  of  non-terminal  nodes  above  or  in  level  i—  1,  and  let  L 
be  the  sum  of  the  node  traversing  probabilities  of  non-terminal  nodes  below  or  in  level  i  +  2.  Then,  after  the  variable  swap 
of  level  i  with  level  i+  1,  U  and  L  remain  unchanged. 

Proof.  See  Appendix. 

Theorem  4. 1  shows  that  the  previously  computed  node  traversing  probabilities  need  not  be  repeated  in  computing  the 
new  APL  caused  by  the  swap  of  two  adjacent  variables.  Fig.  3  illustrates  a  subgraph  of  level  i  and  level  i  +  1  in  the  BDD 
when  two  adjacent  variables  are  interchanged.  Since  the  principles  of  variable  swap  for  the  binary  case  and  the  multi-valued 
case  are  the  same,  we  describe  only  the  binary  case.  The  details  of  variable  swaps  for  the  multi-valued  case  are  discussed  in 
[18].  A  subgraph  composed  of  BDD  nodes  involved  in  the  variable  swap  belongs  to  one  of  the  six  classes  shown  in  Fig.  3. 
For  each  class,  the  figure  on  the  left  occurs  before  the  swap,  while  the  figure  on  the  right  occurs  as  a  result  of  the  swap. 
In  Fig.  3,  only  cases  (e)  and  (f)  do  not  change  the  APL,  while  other  cases  change  the  APL.  For  example,  in  case  (a),  the 
node  traversing  probabilities  of  nodes  V2  and  V3  are  changed  as  a  result  of  the  swap.  Before  the  swap,  the  node  traversing 
probabilities  of  vt  and  V3  are  given  by: 

NTP(v2 )  =  ETP{e 0)  =  P(xi  =  0)  x  NTP{v  1 ) 

NTP(v3)  =  ETP{ei)  =  P(xi  =  1)  x  NTP(v  1), 

where  eo  and  e\  denote  the  edges  from  vi  to  V2  and  from  vi  to  V3,  respectively.  On  the  other  hand,  after  the  swap,  the  node 
traversing  probabilities  of  V2  and  V3  are: 


NTP{v  2) 
NTP[v  3) 


P{xi+i  =  0)  x  NTP(v\) 
P(xi+l  =  l)xNTP(vi). 
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Figure  3:  Six  cases  of  exchanging  two  adjacent  variables. 


When  P(xi  =  0)  =  P(x,-+i  =  0)  and  P(x,-  =  1)  =  P(x;+ 1  =  1),  the  node  traversing  probabilities  of  vt  and  V3  do  not  change 
after  the  swap.  Therefore,  in  case  (a),  the  APL  is  changed  by  the  edge  traversing  probabilities  of  outgoing  edges  from  vi. 
Similarly,  in  other  cases  except  for  (e)  and  (f),  the  APL  is  changed  by  the  edge  traversing  probabilities  of  outgoing  edges 
from  the  root  node  of  a  subgraph.  Note  that  from  Theorem  4.1,  we  consider  only  the  edges  from  the  root  node  to  nodes  in 
level  i  +  1  to  update  the  APL. 

We  summarize  the  strategy  for  updating  the  APL  as  follows: 

1 .  Before  the  swap,  for  each  subgraph  involved  in  the  swap,  the  edge  traversing  probabilities  of  the  edges  from  the  root 
node  of  a  subgraph  to  nodes  in  level  i  +  1  are  subtracted  from  1)  the  APL  and  from  2)  the  node  traversing  probabilities 
of  nodes  in  level  i  +  1. 

2.  After  the  swap,  for  each  subgraph,  the  edge  traversing  probabilities  of  edges  from  the  root  node  of  a  subgraph  to 
nodes  in  level  i  +  1  are  re-calculated. 

3.  The  calculated  edge  traversing  probabilities  are  added  to  1)  the  APL  and  to  2)  the  node  traversing  probabilities  of 
nodes  in  level  i  +  1. 

Example  4.1  Fig.  4  shows  BDDs  for  logic  function  f  =  X1X4  VX2X4  VX3.  Fig.  4(a)  shows  the  BDD  with  the  variable  order 
(xi,X2,X3,X4),  top  to  bottom.  For  simplicity,  assume  that  P(x,  =  0)  =  P(x,-  =  1)  =  0.50  (i  =  1,2, 3, 4).  Then,  the  APL  of  the 
BDD  in  Fig.  4(a)  is  2.875.  In  this  BDD,  we  consider  the  swap  of  variables  X2  and  X3.  During  such  a  swap,  case  (b)  applies 
to  node  V2  and  case  (f)  applies  to  node  V4.  Performing  the  swap  leads  to  the  BDD  shown  in  Fig.  4(b).  Note  that  the  swap 
decreases  the  APL  by  0.25  because  the  node  V4  after  the  swap  does  not  have  the  incoming  edge  from  node  V2.  The  node 
traversing  probabilities  associated  with  nodes  V2  and  V3  do  not  change.  The  overall  APL  decreases  from  2.875  to  2.625. 

(End  of  Example) 

Example  4.2  Fig.  5(a)  shows  the  BDD  with  the  variable  order  (x2 ,  X3 ,  x  1 )  for  logic  function  f  =  xi(x2  VX3).  Assume  that 

P(x\  =0)  =  0.6,  P(x\  =  1)  =  0.4, 

P{x 2  =  0)  =  0.3,  P(x2  =  1)  =  0.7, 

P(x3  =  0)  =  0.8,  P(x3  =  1)  =  0.2. 

The  APL  of  the  BDD  in  Fig.  5(a)  is  2.06.  For  the  swap  of  variables  X3  and  xi,  case  (d)  applies  to  node  V2  and  case  (f) 
applies  to  node  V3.  Performing  this  swap  yields  the  BDD  shown  in  Fig.  5(b).  It  changes  the  node  traversing  probabilities  of 
V3  and  V4  (a  new  node).  Before  the  swap,  the  edge  traversing  probability  of  edge  from  V2  to  V3,  0.06,  is  subtracted  from  the 
APL  and  from  the  node  traversing  probabilities  of  v  3.  After  the  swap,  the  edge  traversing  probability  of  edge  from  V2  to  V4, 
0. 12,  is  added  to  the  APL  and  to  V4.  The  overall  APL  increases  from  2.06  to  2. 12.  (End  of  Example) 
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Figure  4:  Example  of  the  update  of  the  APL 


/  / 


(a)  (b) 


Figure  5 :  Another  example  of  the  update  of  the  APL 


4.2  Symmetric  Variables 

Definition  4.1  A  logic  function  f(x  i,X2, . . . ,  x, . . . . .  xj , . . .  ,xn )  is  symmetric  with  respect  to  x,  and  xj  if  the  interchange  of 
Xi  and  Xj  does  not  change  f.  x,  and  xj  are  called  symmetric  variables. 

In  a  DD,  swapping  symmetric  variables  x,  and  xj  does  not  change  the  graph  structure. 

Definition  4.2  Let  Jt]  and  K2  be  permutations  of  the  variables.  If  the  positions  of  variables  in  7t]  are  the  same  as  in  %2 
except  for  symmetric  variables ,  7ti  and  %2  are  called  symmetric  orders. 

Since  symmetric  orders  produce  DDs  with  the  same  graph  structure,  the  DDs  have  the  same  APL  when  P(xj  =  0)  = 
P(xj  =  0),  P(xj  =  1)  =  P(xj  =  1),  . . .,  and  P(xi  =  r  —  1)  =  P(xj  =  r  —  1)  for  symmetric  variables  x\  and  xj.  Therefore,  in 
such  a  case,  detection  of  symmetric  orders  can  reduce  the  computation  time  for  an  APL  minimization  algorithm. 

Example  4.3  Consider  the  logic  function  f  =  x\x^\J  X2X^\/ Xj,  (Fig.  4).  Let  variable  orders  K\  andn2  be  (xi,X2,X3,X4)  and 
(x2,xi,X2,X4),  respectively.  Since x i  andx 2  are  symmetric  variables.  7ti  and %2  are  symmetric  orders.  The  BDDsfor  the  two 
orders  are  the  same  except  the  labels  X\  and  x 2  are  interchanged ,  and  have  the  same  APL  and  the  same  number  of  nodes. 

(End  of  Example) 
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1 

minimize_APL  (DD,  input  variables  X,  #  inputs  n)  { 

2 

^SLlb  =  4*  ‘ 

3 

cost[Xiuh]  —  0  , 

4 

order  [AJuZ)]  =  §  ; 

5 

Snext  =  { Xsub }  j 

6 

min_apl  =  APL  for  initial  DD  ; 

7 

for  (level  =  1;  level  <  n;  level++ )  { 

8 

Scur  —  Snext  i 

9 

Snext  =  4*  ’ 

10 

for  (each  Xsub  €  Scur)  { 

11 

ordering(DD,  orderfA^],  level  -  1) ; 

12 

for  (each  x,-  €  {X  \  AluZ)})  { 

13 

X sub  =  ^sub  U  {x,  }  ; 

14 

Move  Xj  to  level  ; 

15 

symmetry _check(DD,  level)  ; 

16 

if  (order[A'Mfo]  and  current  order  are  symmetric  &&  all  P(x  =  c)s  are  same) 

17 

continue  ; 

18 

Update  min_apl ; 

19 

if  (lower_bound(/ev<?/)  >  min_apl) 

20 

continue  ; 

21 

new_cost  =  cost[AJui]  +  NTP(level)  ; 
if  (new .cost  <  cost[A'HJ)  { 

22 

23 

cost[X'!(/J  =  new.cost ; 

24 

order  [X'mb]  =  current  order  ; 

25 

^  C X'sub  Snext) 

26 

Snext  =  Snext  U  {  Ssuj)  }  ; 

27 

} 

28 

} 

29 

} 

30 

} 

31 

ordering(DD,  ordeijA],  n)  ; 

32 

} 

Figure  6:  Exact  APL  minimization  algorithm. 


4.3  Exact  Minimization  Algorithm 

Fig.  6  shows  a  pseudo-code  to  solve  Problem  4.1.  This  algorithm  finds  an  optimum  solution  using  a  branch-and-bound 
method,  similar  to  the  top-down  algorithm  (JANUS)  in  [9].  JANUS  [9]  uses  the  number  of  nodes  in  a  BDD  as  the  cost 
function,  while  our  algorithm  uses  the  APL  of  a  DD  (BDD  or  MDD)  as  the  cost  function.  By  using  the  node  traversing 
probability  (NTP),  the  changes  in  APL  can  be  calculated  at  each  node  locally.  This  locality  of  computation  allows  a  top- 
down  algorithm.  To  our  knowledge,  this  is  the  first  time  an  APL  minimization  algorithm  based  on  branch-and-bound  has 
been  proposed.  This  algorithm  finds  an  optimum  variable  order  much  faster  than  the  exhaustive  search  method,  which 
enumerates  all  possible  variable  orders.  In  lines  1 1  and  3 1  of  Fig.  6,  procedure  ordering  changes  the  variable  order  of  the 
DD  into  the  given  order  from  the  top  to  the  specified  level.  For  example,  let  the  current  variable  order  be  (x  i ,  X2 ,  X3 ,  X4 ,  X5) . 
We  seek  the  order  (x5,xzt)  at  level  two.  That  is,  we  seek  (x5,X4,  *,  *,  *),  where  “*,*,*”  represents xi,  X2,  andx3  in  some  order. 
Then,  procedure  ordering(DD,  (x5,X4),  2)  obtains  the  order  (x5,X4,xi,X2,X3)  in  7  swaps  from  the  order  (xi,X2,X3,X4,X5). 
Procedure  symmetry  .check  in  line  15  checks  symmetry  of  adjacent  variables  [22].  When  the  variable  order  of  X' . ,  which 
has  already  been  stored  in  array  “order[X'  .]”  as  a  candidate,  and  the  current  variable  order  of  the  DD  are  symmetric,  and 
all  P(x  =  c)s  are  same  for  the  symmetric  variables,  the  current  order  is  excluded  from  the  set  of  candidates.  In  line  19, 
Theorem  3.2  is  used  to  eliminate  the  unneeded  variable  exchanges  to  reduce  computation  time.  In  line  21,  NTP(level) 
denotes  the  sum  of  the  node  traversing  probabilities  of  the  nodes  on  the  given  level  (level).  The  initial  values  of  array  cost 
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1 

sifting_APL  (DD,  #rounds  of  sifting  R)  { 

2 

cost  =  APL  for  initial  DD  ; 

3 

for  (r  =  0;  r  <  R\  /•++)  { 

4 

for  (each  x;-  £l){ 

5 

start  =  current  position  of  x,  ; 

6 

best_p  =  start ; 

7 

for  (each  position  p  from  start  to  the  closer  extreme)  { 

8 

Move  x/  to  p  ; 

9 

Update  U  (or  L ) ; 

10 

if  ( cost  <  U  (or  L)) 

11 

break  ; 

12 

if  (APL  <  cost )  { 

13 

cost  =  APL ; 

14 

best  _p  =  p  ; 

15 

} 

16 

} 

17 

for  (each  position  p  to  the  other  extreme)  { 

18 

Move  Xi  to  p  ; 

19 

Update  U  (or  L) ; 

20 

if  ( cost  <  U  (or  L)) 

21 

break  ; 

22 

if  (APL  <  cost )  { 

23 

cost  =  APL  ; 

24 

ST 

II 

^3 

25 

} 

26 

} 

27 

Move  X,  to  best  -p  ; 

28 

} 

29 

} 

30 

} 

Figure  7 :  Heuristic  APL  minimization  algorithm. 


in  Fig.  6  are  set  to  infinity. 

4.4  Heuristic  Minimization  Algorithm 

The  exact  minimization  algorithm  in  Fig.  6  obtains  an  optimum  solution  for  Problem  4.1.  However,  when  the  number  of 
input  variables  is  large,  finding  the  optimum  variable  order  may  require  much  computation  time. 

In  this  section,  we  show  a  heuristic  minimization  method  using  variable  sifting  [23].  The  sifting  algorithm  repeatedly 
performs  the  following  basic  steps: 

1 .  Change  the  variable  order. 

2.  Compute  a  cost. 

The  proposed  sifting  algorithm  uses  APL  as  the  cost  function.  It  was  shown  in  Section  4. 1  that  the  APL  can  be  efficiently 
updated  after  the  swap  of  two  adjacent  variables.  As  a  result,  the  time  needed  to  compute  the  cost  in  our  sifting  algorithm 
is  comparable  to  the  time  needed  to  update  the  number  of  nodes  in  the  classical  sifting  algorithm,  which  minimizes  the 
number  of  nodes.  Fig.  7  shows  the  pseudo-code  of  the  heuristic  minimization  algorithm.  In  this  algorithm,  each  variable  x(- 
is  sifted  across  all  possible  positions  to  determine  its  best  position.  First,  x,  is  sifted  in  one  direction  to  the  closer  extreme 
(top  or  bottom).  Then,  x,  is  sifted  in  the  opposite  direction  to  the  other  extreme.  In  lines  10  and  20  of  Fig.  7,  Corollary  3.2 
is  used  to  eliminate  unneeded  sifting  of  x,  .  When  variable  x;-  moves  down  to  the  bottom,  we  use  U  equal  to  the  sum  of  the 
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node  traversing  probabilities  of  the  nodes  above  x,-.  If  cost  <  U ,  sifting  of  x,  further  down  to  the  bottom  cannot  lead  to  a 
smaller  APL  than  cost.  In  such  cases,  there  is  no  need  to  continue  sifting  to  the  bottom.  Similarly,  when  variable  x,  moves 
up  to  the  top,  we  use  L  equal  to  the  sum  of  the  node  traversing  probabilities  of  the  nodes  below  x,.  This  lower  bound  for  the 
APL  is  similar  to  the  one  introduced  for  the  number  of  nodes  during  the  classical  sifting  [8]. 


4.5  Initial  Ordering  of  the  Binary  Variables 

The  initial  ordering  of  variables  influences  the  effectiveness  of  the  heuristic  minimization  algorithm  described  in  the  previous 
section.  An  analysis  of  variable  orders  that  produces  the  minimal  APL  in  several  known  classes  of  functions  [6,  28]  leads  to 
a  heuristic  to  find  a  good  initial  variable  order.  In  this  section,  we  propose  an  initial  variable  order  using  Walsh  spectrum  [  1 2] 
for  binary  logic  functions. 

The  value  of  a  first-order  Walsh  spectral  coefficient  expresses  the  correlation  between  the  variable  value  with  the  function 
value.  For  n-variablc  logic  function  f(X),  the  first-order  Walsh  spectral  coefficient  can  be  computed  as  follows  [7]: 


where  |x,  ®  f\  denotes  the  number  of  assignments  of  values  to  the  variables  X  such  that  the  values  of  x,  and  f(X )  are  equal. 
The  initial  variable  order  is  found  by  placing  the  variables  in  descending  order  of  the  absolute  value  of  Rj.  For  variables 
with  identical  absolute  values  of  Rj,  we  arbitrarily  choose  the  order. 

All  spectral  coefficients  can  be  computed  by  scanning  the  nodes  beginning  at  the  root  node  and  ending  on  the  terminal 
nodes  using  a  fast  algorithm  [31].  The  first-order  coefficients  can  be  computed  by  a  simplified  version  of  the  general 
algorithm. 

Example  4.4  Consider  the  logic  function  f  =  X1X4  V  X2X4  V  X3  in  Example  4.1.  For  each  binary  variable  X/,  the  value  of 
\xj  ©  / 1  is  given  by: 

|*i®/|=9,  1*20/1=9,  |*3©/|  =  13,  |*40/|  =  11. 

The  value  of  each  Rj  corresponding  to  Xj  is  as  follows: 

115  3 

*1=8’  *2=8’  *3=8’  *4=8' 

Therefore,  we  have  an  initial  variable  order  xj,X4,X\  ,X2,  and  A  PL  =  1 .875.  This  is  the  minimum  APL  for  f.  (End  of  Example) 

5  PAIRED  ORDERING  OF  BINARY  VARIABLES 

Unfortunately,  there  are  no  standard  benchmark  functions  for  multi-valued  logic.  Thus,  4-valued  input  2-valued  output 
functions  obtained  by  pairing  binary  variables  of  2-valued  benchmark  functions  are  often  used  for  experiments  for  the 
multi-valued  case  [11,  18,  24],  Especially,  [11,  24]  show  that  4-valued  MDDs  can  represent  binary  logic  functions  more 
compactly  than  BDDs,  by  considering  paired  orderings  of  binary  variables.  4-valued  MDDs  can  be  implemented  efficiently 
using  LUT-based  FPGAs.  In  Section  6.2,  we  also  use  the  4-valued  input  2-valued  output  functions  for  experiments  of  the 
multi-valued  case,  and  show  that  4-valued  MDDs  can  reduce  the  APL,  as  well  as  the  number  of  nodes  efficiently.  To  do 
this,  in  this  section,  we  define  a  paired  ordering  of  binary  variables. 

Definition  5.1  Let  f(X )  be  a  2-valued  logic  function,  where  X  =  (xi,X2, . . .  ,x„)  is  an  ordered  set  of  binary  variables. 
Let  {X}  denote  the  unordered  set  of  variables  in  X.  Let  Xj  C  X.  If  {X}  =  {Xi}  U  {X2}  U  . . .  U  {Xu},  {X,}  0  {X/}  =  ([> 
(i  ^  j),  and  \Xj\  =  2,  then  (Xi,Xj, . . .  ,XU)  is  a  paired  ordering  of  binary  variables  X,  and  each  Xj  can  be  represented 
as  a  4-valued  variable.  And  then,  a  2-valued  logic  function  f(X)  can  be  represented  by  the  mapping  f(X  i,X2,...,Xu): 
{0, 1,2,3}“  — >  {0, 1}. 

For  n-variable  functions,  if  n  <  2 u  (i.e.  n  is  an  odd  number),  we  use  an  additional  redundant  binary  variable  called  a 
dummy  variable.  The  set  of  binary  variables  with  the  dummy  variable  if  it  exists,  is  denoted  by  {Xr\  =  {xi  ,X2, . . .  ,x„,x„+i}, 
where  |X'|  =n+  1  =  2 u.  Note  that  /  is  independent  of  x„+i. 
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Theorem  5.1  The  number  of  different  paired  orderings  of  binary  variables  X  =  (xi,X2,  ■  ■  ■  ,x„)  to  consider  is 

n\ 

2"  ’ 

where  u  denotes  the  number  of  4-valued  variables  obtained  by  the  pairing,  and  is  given  by 

n 

U  =  2 

Note  that  we  assume  that  n  =  2 u.2 
Proof.  See  Appendix. 

In  [11,  24],  heuristic  paired  ordering  algorithms  for  node  minimization  have  been  proposed.  However,  in  this  paper,  we 
consider  the  paired  ordering  algorithms  for  APL  minimization.  We  formulate  the  APL  minimization  problem  considering 
the  paired  orderings  of  binary  variables  as  follows: 

Problem  5.1  Given  a  binary  logic  function  f(X),  find  a  paired  ordering  of  binary  variables  X  that  produces  an  MDD  with 
the  minimum  APL. 

5.1  Exact  Paired  Ordering  Algorithm 

Fig.  8  shows  a  pseudo-code  to  solve  Problem  5.1.  This  algorithm  finds  an  optimum  solution  for  Problem  5.1  using  branch- 
and-bound,  similar  to  the  algorithm  in  Fig.  6.  In  lines  6  and  38  of  Fig.  8,  procedure  pairing  produces  an  MDD  for  4- valued 
input  2-valued  output  function  by  making  pairs  of  binary  variables  from  the  given  variable  order.  In  line  7,  minxipl  is 
set  to  the  APL  of  MDD  obtained  by  making  pairs  of  binary  variables  from  the  variable  order  of  given  BDD.  In  line  26, 
pairing -NT  P  [level)  denotes  the  sum  of  the  node  traversing  probabilities  of  the  MDD  nodes  obtained  by  pairing  binary 
variables  in  level  and  level  +  1 . 

5.2  Heuristic  Paired  Ordering  Algorithm 

In  this  section,  we  show  a  heuristic  paired  ordering  algorithm.  The  heuristic  paired  ordering  algorithm,  called  pair-sifting 
algorithm,  consists  of  the  following  four  basic  steps: 

1.  Apply  the  sifting  algorithm  for  APL  minimization  presented  in  Section  4.4  to  the  BDD  of  the  given  binary  logic 
function. 

2.  Make  pairs  of  binary  variables  from  the  variable  order  obtained  by  the  sifting  algorithm. 

3.  Construct  an  MDD  for  the  4- valued  input  2- valued  output  function. 

4.  Apply  the  sifting  algorithm  to  the  MDD. 

This  strategy  is  similar  to  one  used  in  [24],  which  minimizes  the  number  of  nodes  in  an  MDD. 

6  EXPERIMENTAL  RESULTS 

Experiments  using  MCNC  benchmarks  were  conducted  in  the  following  environment: 

•  CPU:  Pentium4  Xeon  2.8GHz 

•  LI  Cache:  32KB 

•  L2  Cache:  512KB 

•  Main  Memory:  4GB 

•  Operating  System:  redhat  (Linux  7.3) 

•  C-Compiler:  gcc  -02 

In  this  section,  we  assume  that  P[xi  =  0)  =  P[xi  =  1)  =  . . .  =  P[xi  =  r  —  1)  =  i  for  r- valued  input  functions. 

2When  n  is  an  odd  number,  we  use  a  dummy  variable. 
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1 

pairing_APL  (BDD,  binary  variables  X,  #  inputs  n)  { 

2 

NSub  =  6  ; 

3 

costpL,,*,]  =  0  ; 

4 

order  [As„fc]  =  <f)  ; 

5 

Snext  —  { ^suh  }  ' 

6 

pairing(A,  orderpf]) ; 

7 

min_apl  =  APL  for  initial  MDD  obtained  by  pairing  ; 

8 

for  ( level  =  1;  level  <  n\  level+=  2)  { 

9 

Scur  Snext  1 

10 

HfU’Xt  —  6  , 

11 

for  (each  Xsub  G  Scur)  { 

12 

ordering(BDD,  order[As„^],  level  -  1)  ; 

13 

for  (each  .*,■  G  {X \Xsllb})  { 

14 

^sub  —  X sub  U  { A;  }  ; 

15 

Move  Xj  to  level ; 

16 

symmetry _check(BDD,  level) ; 

17 

for  (each  x7  G{X\Xs'Mfc}){ 

18 

K*=K*u{xj}-’ 

19 

Move  xj  to  level  +  1  ; 

20 

symmetry _check(BDD.  level  +  1)  ; 

21 

if  (orderpf'L]  and  current  order  are  symmetric  &&  all  P(x  =  c)s  are  same) 

22 

continue  ; 

23 

Update  min_apl ; 

24 

if  (lower_bound(/eve/  +  1)  >  min_apl) 

25 

continue  ; 

26 

new  .cost  =  costfX^]  +  pairingJXT P(level)  ; 

27 

if  (new .cost  <  cost[X"(/J)  { 

28 

cost[X'[(J  =  new.cost  ; 

29 

order  [X"ub\  =  current  order  ; 

30 

if  (KubtSnex,) 

31 

Snext  —  Snex t  U  {A^}  ; 

32 

} 

33 

} 

34 

} 

35 

} 

36 

} 

37 

ordering(BDD,  order[X],  n)  ; 

38 

pairing(A,  order[A]) ; 

39 

} 

Figure  8:  Exact  paired  ordering  algorithm  for  APL  minimization. 


6.1  Binary  Case 

Table  1  compares  the  number  of  nodes  and  APL  of  BDDs  optimized  using  four  different  methods:  (a)  exact  minimization  of 
the  number  of  nodes;  (b)  exact  minimization  of  the  APL;  (c)  the  algorithm  in  [16];  and  (d)  the  heuristic  APL  minimization 
algorithm  presented  in  this  paper.  In  the  table,  Name  lists  the  names  of  benchmark  functions.  In  and  Out  lists  the  numbers 
of  input  variables  and  single-output  functions,  respectively.  Columns  Nodes  contain  the  number  of  non-terminal  nodes. 
Columns  Time  contain  the  CPU  time  of  three  algorithms  coded  by  us,  in  seconds.  Unfortunately,  the  CPU  time  of  the 
algorithm  in  [16]  is  unavailable.  Columns  “(a)  MinJModes”,  “(b)  Min_APL”,  “(c)  Liu  [16]”,  and  “(d)  sifting”  show  the  exact 
nodes  minimization  algorithm  in  [9],  the  exact  APL  minimization  algorithm  in  Section  4.3,  the  heuristic  APL  minimization 
in  [16],  and  the  heuristic  APL  minimization  in  Section  4.4,  respectively.  Initial  variable  order  for  “(d)  sifting”  was  obtained 
using  Walsh  spectrum  described  in  Section  4.5.  The  BDDs  in  this  table  use  complemented  edges.  Table  1  includes  the 
same  benchmark  functions  as  the  experiment  in  [16]  except  for  incompletely  specified  functions.  We  omitted  incompletely 
specified  functions  because  the  number  of  nodes  and  the  APL  in  BDDs  for  incompletely  specified  functions  depend  on  the 


Table  1 :  Minimization  of  APL  for  individual  BDDs 


Name 

In 

Out 

(a)  Min_Nodes 

(b)  Min_APL 

(c)  Liu  [16] 

(d)  sifting 

Nodes 

APL 

Time 

Nodes 

APL 

Time 

Nodes 

APL 

Nodes 

APL 

Time 

7 

10 

66 

34.13 

0.01 

81 

31.28 

0.01 

91 

31.31 

79 

31.28 

0.01 

feinEBail 

14 

8 

448 

41.75 

22.76 

547 

39.69 

28.71 

899 

47.54 

516 

39.97 

0.01 

bl2 

15 

9 

64 

23.86 

0.03 

68 

21.84 

0.01 

81 

22.22 

71 

21.88 

0.01 

coni 

7 

2 

14 

6.06 

0.01 

16 

5.94 

0.01 

16 

6.06 

16 

5.94 

0.01 

cordic 

23 

2 

73 

13.74 

416.57 

89 

9.43 

1006.08 

259 

11.82 

88 

9.47 

0.01 

sao2 

10 

4 

99 

10.90 

0.26 

116 

10.59 

0.06 

128 

10.71 

121 

10.59 

0.01 

vg2 

25 

8 

202 

31.00 

6431.83 

222 

29.91 

376.78 

230 

30.37 

204 

30.16 

0.01 

misex  1 

8 

7 

54 

23.22 

0.01 

57 

21.97 

0.02 

68 

22.16 

64 

21.97 

0.01 

cm  150a 

21 

1 

32 

3.50 

1106.23 

32 

3.50 

1510.58 

33 

3.50 

32 

3.50 

0.01 

cm  151a 

12 

2 

32 

6.00 

0.38 

32 

6.00 

0.28 

36 

6.50 

32 

6.00 

0.01 

cm  1 62a 

14 

5 

41 

11.76 

0.06 

52 

11.70 

0.05 

59 

11.70 

48 

11.71 

0.01 

cm  163a 

16 

5 

35 

11.70 

0.01 

38 

11.70 

0.01 

42 

11.70 

36 

11.70 

0.01 

cm85a 

11 

3 

38 

7.72 

0.05 

38 

7.72 

0.01 

47 

8.28 

38 

7.72 

0.01 

mux 

21 

1 

32 

3.50 

1098.72 

32 

3.50 

1410.57 

33 

3.50 

32 

3.50 

0.01 

z4ml 

7 

28 

18.25 

0.01 

30 

16.38 

0.02 

32 

17.13 

28 

16.38 

0.01 

f51m 

8 

51 

28.08 

0.01 

65 

27.33 

0.02 

76 

27.45 

64 

27.45 

0.01 

pcle 

19 

79 

22.50 

0.11 

84 

22.50 

0.03 

89 

22.50 

79 

22.50 

0.01 

Average  of  ratios 

1.00 

1.00 

1.00 

1.12 

0.95 

0.93 

1.40 

0.99 

1.10 

0.95 

0.40 
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Table  2:  Minimization  of  APL  for  shared  BDDs  for  larger  functions 


Name 

In 

Out 

classical  sifting 

Coef. 

Time 

Without  Walsh  spectrum 

With  Walsh  spectrum 

Nodes 

APL 

Nodes 

APL 

Time 

Nodes 

APL 

Time 

C432 

36 

7 

1063 

86.58 

0.01 

1081 

86.24 

0.15 

1899 

82.09 

0.83 

C499 

41 

32 

25873 

782.66 

0.02 

32105 

641.16 

7.12 

32105 

641.16 

7.11 

C880 

60 

26 

4122 

140.42 

0.01 

41701 

123.85 

4.48 

91767 

122.22 

52.12 

C1908 

33 

25 

5532 

254.65 

0.01 

16634 

179.20 

0.96 

13868 

171.96 

2.73 

C2670 

233 

140 

1882 

303.34 

0.05 

2755 

278.17 

1.30 

* 

* 

* 

C3540 

50 

22 

24231 

209.15 

0.10 

25162 

208.44 

7.44 

56898 

212.73 

75.21 

C5315 

178 

123 

1728 

460.78 

0.05 

1820 

446.26 

0.26 

* 

* 

* 

C7552 

207 

108 

2212 

485.03 

0.05 

2207 

471.54 

0.87 

* 

* 

* 

apex3 

54 

50 

931 

188.58 

0.01 

900 

158.82 

0.04 

905 

158.73 

0.03 

apex7 

49 

37 

242 

113.88 

0.01 

277 

82.44 

0.01 

280 

82.45 

0.02 

b9 

41 

21 

108 

61.16 

0.01 

131 

55.25 

0.01 

129 

55.39 

0.01 

dalu 

75 

16 

688 

102.67 

0.01 

990 

78.81 

0.08 

1069 

78.81 

35.31 

des 

256 

245 

3297 

1209.50 

0.18 

3343 

1081.13 

0.47 

3886 

1077.63 

2.15 

duke  2 

22 

29 

360 

87.89 

0.01 

386 

77.52 

0.01 

392 

77.52 

0.02 

e64 

65 

65 

128 

128.00 

0.01 

128 

128.00 

0.01 

573 

128.00 

0.05 

ex4 

128 

28 

497 

51.38 

0.01 

629 

47.26 

0.02 

630 

47.26 

0.03 

frg2 

143 

139 

1379 

607.00 

0.04 

1580 

322.89 

0.15 

2189 

321.75 

0.23 

k2 

45 

45 

1257 

181.80 

0.01 

1426 

177.52 

0.07 

1418 

177.50 

0.10 

rot 

135 

107 

7891 

446.47 

0.05 

16164 

312.08 

5.61 

18503 

308.68 

30.34 

Average 

1.00 

1.00 

0.03 

1.87 

0.85 

1.53 

3.01 

0.84 

12.89 

*  Memory  overflow  precluded  computation  of  these  values. 
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assignment  of  values  to  don’t  cares,  as  well  as  the  variable  order.  To  make  our  results  compatible  with  the  results  in  [16],  we 
optimized  each  output  of  the  multiple-output  benchmark  functions  independently,  and  obtained  the  sum  of  the  values  over 
all  outputs.  Thus,  the  number  of  nodes  and  APL  in  Table  1  are  different  from  those  of  the  SBDD.  Two  rounds  of  sifting  are 
performed  in  all  experiments.  The  row  labeled  Average  of  ratios  represents  the  normalized  averages  for  Nodes,  APL,  and 
Time  assuming  the  values  of  “(a)  Min_Nodes”  to  be  1.00.  The  columns  “(b)  Min_APL”,  “(c)  Liu  [16]”,  and  “(d)  sifting”  of 
this  row  contains  the  relative  values  to  the  results  of  “(a)  Min_Nodes”. 

The  heuristic  method  in  [16]  obtained  BDDs  with  the  exact  minimum  APLs  in  5  out  of  17  benchmark  functions.  How¬ 
ever,  for  alu4,  cml51a,  and  cm85a,  the  algorithm  in  [16]  obtained  BDDs  with  much  larger  APLs  than  the  exact  minimum 
APLs.  On  the  other  hand,  our  heuristic  method  in  Section  4.4  obtained  BDDs  with  the  exact  minimum  APLs  in  1 1  out  of 
17  benchmark  functions.  For  five  of  the  remaining  functions,  the  APLs  in  the  column  labeled  “(d)  sifting”  are  smaller  than 
or  equal  to  the  APLs  in  “(c)  Liu  [16]”.  For  cml62a,  our  sifting  algorithm  obtained  BDDs  with  slightly  larger  APLs  than  the 
exact  minimum  APLs. 

An  exhaustive  search  algorithm  finds  the  minimum  APLs  for  the  functions  with  up  to  14  inputs  within  a  reasonable 
computation  time.  Meanwhile,  our  exact  minimization  algorithm  in  Section  4.3  found  the  minimum  APL  for  functions  with 
25  inputs  (vg2)  within  a  reasonable  computation  time. 

Table  2  shows  the  results  for  larger  MCNC  benchmarks  and  the  effectiveness  of  the  initial  variable  order  using  the  Walsh 
spectrum.  In  this  table,  we  used  SBDDs  with  complemented  edges  for  multiple-output  functions.  In  Table  2,  the  column 
“classical  sifting”  shows  the  number  of  nodes  and  APL  for  BDDs  obtained  by  the  sifting  algorithm  [23]  which  minimizes 
the  number  of  nodes  in  BDD.  The  column  “Without  Walsh  spectrum”  shows  the  results  of  our  sifting  algorithm,  which 
minimizes  the  APL,  where  the  initial  variable  orders  are  the  variable  orders  of  BDDs  obtained  by  “classical  sifting”.  And, 
the  column  “With  Walsh  spectrum”  shows  the  results  of  our  sifting  algorithm,  where  the  initial  variable  orders  were  obtained 
using  Walsh  spectrum  shown  in  Section  4.5.  The  column  “Coef.  Time”  denotes  the  CPU  time  needed  to  calculate  the  values 
of  first-order  Walsh  spectral  coefficients  R,,  in  seconds.  Unfortunately,  for  C2670,  C5315,  and  C7552,  BDDs  with  the  initial 
variable  orders  could  not  be  constructed  due  to  memory  overflow.  The  row  labeled  Average  represents  average  of  Time  and 
normalized  averages  of  Nodes  and  APL  assuming  the  values  of  “classical  sifting”  to  be  1.00.  The  columns  “Without  Walsh 
spectrum”  and  “With  Walsh  spectrum”  show  the  relative  values  to  the  results  of  “classical  sifting”. 

For  some  benchmark  functions,  for  example,  C1908,  frg2,  and  rot,  the  APLs  are  reduced  drastically.  For  C7552,  the 
number  of  nodes  is  reduced  as  a  byproduct  of  the  APL  minimization.  However,  for  most  functions,  the  number  of  nodes 
is  increased  by  the  APL  minimization.  The  comparison  of  “Without  Walsh  spectrum”  and  “With  Walsh  spectrum”  shows 
the  effectiveness  of  the  initial  variable  order  using  Walsh  spectrum.  For  8  out  of  19  benchmark  functions,  the  APLs  in  the 
column  “With  Walsh  spectrum”  are  smaller  than  the  APLs  in  “Without  Walsh  spectrum”.  The  computation  time  to  calculate 
the  values  of  P,  is  short. 

However,  for  most  functions,  the  computation  times  of  sifting  for  “With  Walsh  spectrum”  are  significantly  longer  than 
that  for  “Without  Walsh  spectrum”  because  the  number  of  nodes  in  BDD  with  initial  variable  order  computed  using  Walsh 
spectrum  is  large.  When  the  number  of  nodes  in  the  BDD  is  large,  swapping  one  pair  of  adjacent  variables  takes  a  longer 
time  because  the  time  needed  for  the  swap  is  roughly  proportional  to  the  number  of  nodes  present  on  the  given  levels  in  the 
BDD. 

Tables  1  and  2  show  that  the  proposed  heuristic  minimization  minimizes  the  APL  in  short  computation  time.  For  small 
benchmark  functions  in  Table  1,  the  heuristic  minimization  could  obtain  BDDs  with  near-minimum  APLs.  For  large  bench¬ 
mark  functions  in  Table  2,  the  heuristic  algorithm  reduces  APLs  to  84%  on  the  average. 


6.2  Multi-Valued  Case 

There  are  no  standard  benchmark  functions  for  multi-valued  logic.  Thus,  by  pairing  binary  variables  of  2-valued  benchmark 
functions,  we  obtained  4- valued  input  2-valued  output  functions.  Table  3  compares  the  number  of  nodes  and  the  APLs  of  the 
BDDs  and  the  MDDs  optimized  using  four  algorithms:  (a)  exact  minimization  of  the  APL  for  BDDs;  (b)  exact  minimization 
of  the  number  of  nodes  for  MDDs;  (c)  exact  minimization  of  the  APL  for  MDDs;  and  (d)  the  heuristic  APL  minimization 
algorithm  for  MDDs.  Columns  “BDD”,  “MinJModes”,  “Min_APL”,  and  “pair- sifting”  denote  the  exact  APL  minimization 
algorithm  in  Section  4.3,  the  exact  paired  ordering  algorithm  for  node  minimization,  the  exact  paired  ordering  algorithm  for 
APL  minimization  in  Section  5.1,  and  the  pair-sifting  algorithm  in  Section  5.2,  respectively.  The  symbol  in  this  table 
denotes  that  the  results  could  not  be  obtained  because  of  memory  overflow.  The  bottom  row  labeled  Average  represents 
normalized  averages  of  Nodes,  APL,  and  Time  for  all  functions  except  for  4  functions  ( cordic ,  cml50a,  mux,  pcle),  where 
the  values  of  “BDD”  are  set  to  1 .00.  The  SBDDs  and  SMDDs  in  this  table  do  not  use  complemented  edges.  Note  that  the 
values  (Nodes,  APL,  Time)  of  BDDs  in  this  table  are  different  from  the  values  in  Table  1,  because  in  this  table,  SBDDs 


452  N AG AYAM  A,  et  al. 


Table  3:  Comparison  of  APLs  for  SBDDs  and  SMDDs 


BDD 

MDD 

Min_Nodes 

Min_APL 

pair-sifting 

Name 

Nodes 

APL 

Time 

Nodes 

APL 

Time 

Nodes 

APL 

Time 

Nodes 

APL 

Time 

hM£Slg||| 

68 

32.00 

0.01 

37 

20.69 

0.01 

37 

20.69 

0.02 

37 

20.69 

0.01 

fen  ra  Mill 

462 

40.70 

47.96 

247 

27.49 

77.46 

290 

23.26 

101.64 

393 

27.00 

0.01 

bl2 

66 

22.77 

2.26 

39 

22.84 

387.21 

50 

16.83 

5.11 

55 

19.00 

0.01 

coni 

18 

6.31 

0.01 

11 

4.63 

0.01 

11 

4.06 

0.01 

13 

5.69 

0.01 

cordic 

94 

9.46 

3996 

* 

* 

* 

43 

5.21 

3587 

70 

5.81 

0.01 

sao2 

102 

10.64 

0.12 

46 

9.98 

0.21 

58 

6.57 

0.07 

55 

6.79 

0.01 

misexl 

46 

22.84 

0.02 

22 

15.50 

0.01 

28 

12.75 

0.01 

30 

15.41 

0.01 

cm  150a 

32 

3.50 

5972 

* 

* 

* 

13 

2.25 

19509 

26 

2.47 

0.01 

cml51a 

32 

6.00 

0.47 

16 

5.50 

0.13 

22 

4.00 

0.80 

22 

4.00 

0.01 

cm  162a 

38 

11.70 

0.86 

19 

8.73 

0.55 

27 

8.44 

0.56 

23 

8.66 

0.01 

cm  163a 

40 

11.70 

1.98 

19 

12.25 

19.60 

22 

8.16 

1.11 

19 

8.16 

0.01 

cm85a 

37 

7.72 

0.08 

15 

6.28 

0.08 

16 

5.38 

0.16 

25 

5.84 

0.01 

mux 

32 

3.50 

6334 

* 

* 

* 

13 

2.25 

13811 

26 

2.47 

0.01 

z4ml 

26 

16.38 

0.01 

10 

9.13 

0.01 

10 

9.13 

0.01 

10 

9.13 

0.01 

f51m 

76 

28.02 

0.03 

39 

18.81 

0.02 

41 

17.38 

0.03 

41 

18.81 

0.01 

pcle 

83 

22.50 

34.46 

* 

* 

* 

50 

16.50 

29.56 

48 

20.92 

0.01 

Average 

1.00 

1.00 

1.00 

0.50 

0.79 

15.89 

0.59 

0.64 

1.28 

0.63 

0.70 

0.34 

*  Memory  overflow  precluded  computation  of  these  values. 
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Table  4:  Minimization  of  APL  for  SMDDs  for  larger  functions 


Name 

Node  pair-sifting  [24] 

APL  pair-sifting 

Nodes 

APL 

Time 

Nodes 

APL 

Time 

C432 

617 

59.84 

0.03 

721 

58.75 

0.15 

C499 

13541 

407.23 

1.52 

16397 

339.73 

9.23 

C880 

3025 

118.99 

0.30 

34730 

107.89 

9.35 

Cl  908 

4390 

167.42 

0.55 

15287 

124.36 

1.49 

C2670 

2336 

276.19 

0.31 

3945 

260.65 

1.99 

C3540 

22519 

155.06 

7.33 

24241 

157.57 

19.16 

C5315 

1947 

398.53 

0.23 

2258 

393.20 

0.49 

C7552 

2292 

420.69 

0.45 

2236 

431.77 

1.86 

apex3 

628 

143.66 

0.03 

694 

96.72 

0.06 

apex7 

200 

99.82 

0.02 

257 

73.29 

0.01 

b9 

126 

55.90 

0.01 

173 

51.66 

0.02 

dalu 

523 

70.55 

0.03 

644 

42.93 

0.13 

des 

2685 

934.38 

0.55 

2994 

911.44 

1.04 

duke2 

272 

65.99 

0.02 

288 

51.50 

0.01 

e64 

96 

96.67 

0.01 

993 

86.44 

0.03 

ex4 

420 

46.00 

0.02 

482 

39.45 

0.05 

frg2 

1179 

544.34 

0.10 

1457 

260.11 

0.20 

k2 

1055 

168.17 

0.05 

912 

108.42 

0.11 

rot 

5615 

393.57 

1.29 

14898 

284.22 

9.52 

Average 

1.00 

1.00 

1.00 

2.40 

0.83 

4.66 

without  complemented  edges  are  used. 

The  pair-sifting  algorithm  obtained  MDDs  with  the  exact  minimum  APL  for  4  functions.  On  the  average,  the  pair¬ 
sifting  algorithm  reduced  the  APL  to  70%  of  “BDD".  For  coni,  the  pair-sifting  algorithm  obtained  larger  APL  than  that  of 
“Min_Nodes”  due  to  heuristic  pairing  algorithm.  However,  this  algorithm  can  obtain  a  smaller  APL  and  fewer  nodes  than 
those  of  the  corresponding  BDD.  Although  the  exact  paired  ordering  algorithms  for  nodes  and  APL  can  reduce  both  nodes 
and  APL  drastically,  they  are  time-consuming.  On  the  other  hand,  the  pair-sifting  algorithm  quickly  reduces  both  Nodes 
and  APL. 

Table  4  shows  the  results  for  larger  MCNC  benchmark  functions.  Similarly,  we  obtained  4-valued  input  2-valued  output 
functions  by  pairing  binary  variables.  Column  “Node  pair-sifting  [24]”  denotes  the  heuristic  paired  ordering  algorithm  for 
the  node  minimization  method  proposed  in  [24].  The  number  of  nodes  in  an  MDD  obtained  by  the  paired  ordering  algorithm 
for  node  minimization  is  smaller  than  or  equal  to  the  corresponding  BDD  [24],  However,  since  the  MDDs  in  this  table  do 
not  use  the  complemented  edges,  some  MDDs  are  larger  than  the  BDDs  with  complemented  edges  in  Table  2.  The  bottom 
row  labeled  Average  represents  normalized  averages  of  Nodes,  APL,  and  Time  assuming  the  values  of  “Node  pair-sifting 
[24]”  to  be  1.00. 

For  these  benchmark  functions,  the  pair-sifting  algorithm  reduced  the  APL  to  83%  of  “Node  pair-sifting  [24]”,  on 
average.  Especially,  for  frg2,  the  APL  was  reduced  to  48%  of  “Node  pair-sifting  [24]”.  The  pair-sifting  algorithm  cannot 
always  find  an  MDD  with  the  minimum  APL,  because  it  is  a  heuristic  algorithm.  For  C3540  and  C7552,  the  APLs  are 
slightly  larger  than  that  in  “Node  pair-sifting  [24]”.  However,  Tables  2  and  4  show  that  the  pair-sifting  algorithm  can  find 
an  MDD  with  smaller  APL  than  APL  of  corresponding  BDD. 


7  CONCLUSION  AND  COMMENTS 

We  have  proposed  an  exact  and  a  heuristic  algorithm  for  the  minimization  of  the  APL  in  BDDs  and  MDDs.  The  experimental 
results  using  MCNC  benchmark  functions  show  that:  1)  The  exact  minimization  algorithm  finds  BDDs  with  the  minimum 
APL  for  the  function  with  up  to  25  input  variables  within  a  reasonable  computation  time.  2)  Using  the  node  and  edge 
traversing  probabilities  to  compute  and  update  the  APLs  after  the  swap  of  two  adjacent  variables,  the  proposed  sifting 
algorithm  can  heuristically  minimize  the  APLs  as  fast  as  classical  sifting,  which  minimizes  the  number  of  nodes.  3)  Using 
an  initial  variable  order  computed  using  Walsh  spectral  coefficients  increases  the  quality  of  the  results  of  APL  minimization 
algorithms.  However,  in  some  cases  the  initial  variable  order  leads  to  BDDs  with  a  large  number  of  nodes,  which  slows  down 
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APL  minimization.  4)  MDDs  produced  by  pairing  binary  variables  have  smaller  APL  and  fewer  nodes  than  corresponding 
BDDs. 
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APPENDIX 

Proof  for  Lemma  2.1.  We  prove  only  the  first  statement;  the  proof  for  the  second  statement  is  similar.  Consider  a  node  v. 
Any  path  that  includes  an  incoming  edge  to  v  includes  v.  Conversely,  any  path  that  includes  v  includes  an  incoming  edge 
to  v.  It  follows  that  any  assignment  of  values  to  the  variables  that  corresponds  to  a  path  through  v  contributes  to  the  node 
traversing  probability  of  v  an  amount  that  is  identical  to  the  amount  contributed  to  the  edge  traversing  probability  of  an 
incoming  edge  to  v.  It  follows  that  the  node  traversing  probability  of  v  is  equal  to  the  sum  of  edge  traversing  probabilities 
of  all  incoming  edges  to  v.  □ 

Proof  for  Theorem  2.1.  We  prove  only  the  first  statement;  the  proof  for  the  second  statement  is  similar.  From  Definition  2.6, 
we  have 

ETP(e)  =  X  pp(P)> 

peSP(e) 


(1) 
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where  SP(e)  is  a  set  of  paths  including  the  edge  e.  We  prove  the  following 

Ne 

APL=  ^ETP(ei),  (2) 

i=  1 

where  Ne  denotes  the  number  of  edges  in  a  DD.  From  (1),  (2)  can  be  transformed  as  follows: 

Ne 

APL  =  X ETP(ei ) 

i=  1 

=  £  I  PP{P)  (3) 

i=l  peSP(ei) 

From  Definition  2.4,  we  have 

N 

APL  =  X Pp(P<)xli 
1=1 

=  £  I  PP(Pi)  (4) 

«=U=  1 

Although  (3)  and  (4)  use  different  computational  approaches,  they  obviously  compute  the  same  value.  □ 

Proof  for  Lemma  3.1.  An  SMDD  for  F  =  (/o,/t ,  •  •  •  ,/m-t)  is  traversed  from  a  root  node  to  a  terminal  node  777  times  to 
evaluate  multiple-output  function  F.  Since  mu  root  nodes  are  located  above  or  in  level  i,  mu  traversals  via  edges  in  Cut ( i) 
are  performed  while  evaluating  the  multiple-output  function.  Therefore,  we  have  ETP(Cut(i))  =  mu-  □ 

Proof  for  Lemma  3.2.  From  Lemma  2.1,  the  following  relation  holds: 

ETP{Cut'{i))  =  Yj  NTP(v), 

vevc 

where  Vc  denotes  a  set  of  non-terminal  nodes  representing  the  cofactors  with  respect  to  Xupper.  The  probability  of  the 
occurrence  of  the  cofactor  depends  only  on  the  function  and  not  the  order  of  Xupper.  Since  Cut'  (i)  does  not  include  the 
edges  to  terminal  nodes,  the  upper  bound  of  mu  on  c,  follows  from  Lemma  3.1.  □ 

Proof  for  Theorem  3.1.  All  nodes  representing  cofactors  with  respect  to  the  variables  in  Xupper  and  /;//  root  nodes  are 
situated  below  or  in  level  i  +  1 .  Thus,  L  includes  the  node  traversing  probabilities  of  these  nodes.  □ 

Proof  for  Theorem  3.2.  Let  L  be  the  sum  of  the  node  traversing  probabilities  of  the  non-terminal  nodes  below  or  in  level 
i  +  1 .  From  Theorem  2. 1 ,  we  have 

APL=U+L. 

Then,  from  Theorem  3.1,  for  any  permutation  of  Xiower, 

APL  >  U  +  ETP(Cut'(i))  +  mL. 


□ 

Proof  for  Theorem  4.1.  The  variable  swap  of  level  i  and  level  i  +  1  does  not  influence  the  graph  structure  except  for  levels 
7  and  i  +  1  because  of  the  locality  of  the  swap  operation.  Thus,  it  is  clear  that  U  remains  unchanged.  From  Lemma  2.1,  L  is 
obtained  by  the  sum  of  ETP(Cut'(i  +  1))  and  ETP(Eiower),  where 

Cut'(i+  l)  =  {e\e£  Cut(i+  1),  e  is  incident  to  a  non-terminal  node}, 

E iOWer  =  {e  |  e  is  an  edge  situated  below  or  in  level  i  +  2}, 

ETP(Cut\i+ 1))  =  £  ETP(e), 

e£Cut'(i+ 1) 

ET  P{Elower)  =  X  ETP(e). 

lower 
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By  Lemma  3.2,  ETP(Cut'(i+  1))  is  an  invariant.  ETP(Eiower)  remains  unchanged  because  of  the  invariance  of  ETP(Cut'  (i+ 
1))  and  the  locality  of  the  swap  operation.  Therefore,  L  also  remains  unchanged.  □ 

Proof  for  Theorem  5.1.  The  number  of  different  permutations  of  binary  variables  X  is  n\.  Since  from  Definition  5.1,  the 
binary  variables  X  are  partitioned  into  the  unordered  sets  {Xi},  {X2}, . . . ,  {X„},  the  order  of  binary  variables  in  each  {X,} 
is  not  important.  The  number  of  different  permutations  of  two  binary  variables  in  each  {X,}  is  2.  Therefore,  we  have  the 
theorem.  □ 


